SUSCEPTIBILITY IN SUBCRITICAL RANDOM GRAPHS 



SVANTE JANSON AND MALWINA J. LUCZAK 

Abstract. We study the evolution of the susceptibility in the subcrit- 
ical random graph G(n,p) as n tends to infinity. We obtain precise 
asymptotics of its expectation and variance, and show it obeys a law of 
large numbers. We also prove that the scaled fluctuations of the suscep- 
tibility around its deterministic limit converge to a Gaussian law. We 
further extend our results to higher moments of the component size of a 
random vertex, and prove that they are jointly asymptotically normal. 



1. Introduction 

The susceptibility x(G) of a graph G (deterministic or random) is defined 
as the mean size of the component containing a random vertex. (As is well 
known, for random graphs of the random-cluster model, this, or rather its 
expectation, corresponds to the magnetic susceptibility in Ising and Potts 
models.) If G has n vertices and components C%, .. . ,Ck, where K is the 
number of components, then thus 

x (G)=f2 m \C,\ = -E\C,\ 2 . (1.1) 

i=l i=l 

We define, for integers k > 1, 

K 

S k (G) :=£|C;|*. (1.2) 

i=i 

Thus x(G) = n~ 1 S2(G), and similarly n~ 1 S m +i is the rath moment of the 
size of the component containing a random vertex. (Note that by choosing 
a uniform random vertex, we bias the components by their sizes. The mean 
size of a uniformly chosen random component is n/K, which is different and 
which will not be treated here.) 

The purpose of this paper is to study x(G(n,p)), or equivalently S2(G(n,p)) 
for the standard Erdos-Renyi random graph G(n,p) with n vertices where 
each possible edge appears with probability p, independently of all other 
edges; we will also give extensions to Sk(G(n,p)) for larger k. 

We consider asymptotics as n — > oo, with p = p(n) a function of n. (All 
unspecified limits are as n — > oo.) 
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It is well-known, see e.g. Bollobas [2] and Janson, Luczak and Ruciriski 
[IH, that if np is a little larger than 1, np — 1 3> ra -1 / 3 to be precise, then 
G(n,p) has w.h.p. a giant component which is much larger than the others 
(the supercritical case). It is easily seen that then the giant component will 
dominate all other terms in the sum (|1.2j) ; hence, if the largest component is 
d, then S k (G(n,p)) = (1 + Op(l))|Ci|* and X (G(n,p)) = (1 + o p (l))|C 1 \ 2 /n. 
See Appendix |A] for a more precise statement (and proof). 

Similarly, if np = 1 + 0(n -1 / 3 ) (the critical case), then there are several 
components of the order n 2 / 3 ; in this case S k will be of order n 2fc / 3 , and 
thus x °f order n 1 / 3 , and it follows from Aldous [l[ that these quantities, 
properly normalized, converge in distribution to some random variables but 
not to constants. See Appendix [B] for details. 

In this paper we therefore concentrate on the case np < 1, and in particu- 
lar l — np^> n -1 / 3 (the subcritical case). We will prove the following results 
for x(G(n,p)), together with similar results for Sk(G(n, p)) stated later. 

We use O p and o p in the standard sense, see e.g. [ll|, pp. 10-11], and 

write X n ~ p a n for X n = a n + o p (a n ) or, equivalently, X n /a n — 1. We 
will also write X n = OLp(a n ) if ||A n |ji P := (ElX^) 1 ^ = 0(a n ), and, 
similarly, X n = oi,p(a n ) if ||X n ||iP = o(a n ). (Here, X n and a n are sequences 
of random variables and positive numbers.) 

Theorem 1.1. Uniformly, for all n > 1 and < p < n , 

E ^rt) = r^( 1+0 (^ ? ))' < L3 > 

Var.*(G(n,p)) = 0< „ 1 A (1.4) 



n(l — npY 



and 



X (G(n,p)) = r L- (l + O p ((n(l-np) 3 )- 1/2 )). (1.5) 

/n particular, if 1 - np > n -1 / 3 , t/ien x{G(n,p)) ~ p 1/(1 — np). 

One way to handle to explosion at p = 1/n is to consider 1/Ex or 1/%. 
In this form we can obtain uniform estimates for all p. 

Corollary 1.2. Uniformly, for all n > 1 and < p < 1, 

' (1 -np)+ + 0(n- 1 / 3 ), (1.6) 

x(g ( „,p)) (i-»rt + + o,(»- ,/3 ). (") 

The last statement of Theorem 11.11 can be sharpened to asymptotic nor- 
mality. We will also find the variance more precisely. We write X n ~ 
AsN( / u n ,<7 2 ) if (X n ) is a sequence of random variables and fj, n and o n > 

are real numbers such that (X n — fi n )/a n —* N(0, 1). 



Ex(G(n,p)) 
1 
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Theorem 1.3. If p = p(n) < n 1 and further 1 — np S> n 1 / 3 , i/ien 
x(G(„,rt) ~ A 8 N( r i^, 

and Var x{G{n,p)) ~ 2»/ (1 — np) 5 . 

It follows easily from x(G(n,p)) > that the asymptotic normality in 
Theorem 11.31 cannot hold for 1 — np = 0(n _1//3 ). 

The proof of Theorem 11.11 (given in Sections [3HH) is fairly simple and 
is based on studying how evolves for the Erdos-Renyi random graph 
process Q(n,t) (defined in Section [2]). Heuristically, it is easy to see that 
(ignoring the difference between a random variable and its mean), Sk ought 
to be an approximative solution to the differential equation f'{t) = f 2 (t), 
which (with the initial value /(0) = n) is solved by f(t) = n/(l — nt). 
We make this precise and rigorous below. This simple idea has presumably 
been noticed by several people, and at least the leading terms in (|1.3j) and 
(|1.5p are more or less known folk theorems. However, we do not know of 
any rigorous treatments, except [13] which uses the susceptibility to study 
a class of more complicated random graph process. Their processes include 
the Erdos-Renyi process studied here, so their results include the leading 
term asymptotics in f 1 1 . 3 [) and (|1.5j) in the case where p < (l — e)/n for some 
constant e > 0. Their analysis involves branching processes approximation, 
as well as differential equations, and seems contingent on the fact that the 
component distribution (excluding the giant in the supercritical case) has 
exponentially decaying tails. 

The proof of Theorem 11.31 is more involved; the asymptotic normality is 
based on using a martingale central limit theorem for a suitable modification 
of the process Sk(G(n, t)) (Section[5]), while the variance is estimated directly 
(Section [6]). 

In Section the asymptotic results for Sk are interpreted using the Borel 
distribution and its moments. 

Remark 1.4. It is seen from Theorem 11.11 that the susceptibility blows up 
at p = 1/n, which of course is another sign of the phase transition there, with 
the emergence of a giant component. In fact, our results give a new proof 
that there is no giant component for smaller p. In the opposite direction, 
the explosion of the susceptibility at (or close to) p = 1/n shows that there 
are large components at that stage; it is tempting to conclude that a giant 
component emerges around this instance (as we know by other argumants), 
but a formal proof based on this seems to require some additional work. See 



Spencer and Wormald [17( where this type of arguments is used for a class 



of more complicated random graph processes. 

Remark 1.5. An alternative approach to at least some of our results is 
to use the standard branching process approximation of the neighbourhood 
exploration process; this will be treated elsewhere. 
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Remark 1.6. In this paper we study the random graph G(n,p). Most or 
all of our results transfer easily to the random graph G(n, m) with a fixed 
number of edges by monotonicity (Lemma I2.ip and the standard device 
of coupling G(n,m) with G(n,p) for a suitable p such that the expected 
number of edges is slightly smaller or larger than m. We leave the details 
to the reader. 

Acknowledgement. This work was initiated during the programme "Com- 
binatorics and Statistical Mechanics" at the Isaac Newton Institute, Cam- 
bridge, 2008, where SJ was supported by a Microsoft fellowship. 

2. Preliminaries 

We first note a simple monotonicity. 

Lemma 2.1. If H is a subgraph of G, then S k (H) < Sk(G) for every k > 1. 

Proof. It suffices to consider the case when G is obtained from H by either 
adding a single edge or adding a single vertx (and no edges) ; both cases are 
immediate. □ 

The random graph process Q(n,t) starts at t = with n vertices and 
no edges, and where edges are added randomly and independently to every 
possible pair of vertices with rate 1, i.e., the time edge ij is added has 
an exponential distribution with mean 1. Hence, at a given time t, each 
possible edge is present with probability 1 — e~*, so G(n, t) is a random graph 
G(n, 1 — e _< ). We are interested in the subcritical case where t < 1/n; then 
the difference between 1 — e~ l and t is 0(t 2 ) = 0(n~ 2 ) which is negligible, 
and we can see Q(n,t) as a convenient version of G(n,t). More precisely, 
G(n,p) can be obtained as G(n, — log(l — p)); this slight reparametrization 
is annoying but harmless, and it will be convenient in the proofs below. 

We write Sk(t) for Sk{G(n,t)). (These and other quantities introduced 
below depend on n, but we choose not to show this explicitly in the notation.) 

We further define, for a graph G with components and k, I > 1, 

S k ,i(G) ■= \ C *\ k \ C i\ l = S k (Cf)Si(G) - S k+l (G). (2.1) 

We write S kj i(t) for S kt i(Q(n,t)). 

3. The expectation 

We may and will assume that the edges are added to Q{n,t) at distinct 
times. If a new edge joins two different components C{ and Cj in Q(n,t), 
then Sk(t) increases by a jump 

AS k (t) = (|C| + \C 3 \f - \C t \ k - |C/ = (f) WW' 1 - (3-1) 
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For each unordered pair the intensity of such jumps equals the num- 

ber of possible edges joining the two components, i.e. |Cj[|Cj|. We consider 
ordered pairs of components and therefore divide this by 2, and summing 
over all pairs we find that the drift of Sk(t) is 

V k (t) := E^H C ;I E (/) \^\%\ k ~ l = Yl\ (*) Si + x, k+ i-i(t); (3.2) 
in other words, noting that S k (0) = n, 

M k (t) := S k {t)-n- [ V k {u) du (3.3) 



Jo 

is a martingale on [0, oo) with M k (0) = 0. (Note that M k {t) is bounded for 
each fixed n and t in a finite interval [0, T]; hence, there are no problems with 
integr ability of this martingale. The same holds for all similar martingales 
below.) 

We define s k (t) := ES k (t), noting that s k (0) = n, and conclude from the 
martingale property that EMj(t) = EM k (0) = and thus 

s k (t)=ES k (t) = n+ [ EV k {u)du. (3.4) 
Jo 

In order to use this, we need information on ES k j(t). 

Lemma 3.1. For all k,l > 1; 

(i) ESy(i) < s k {t)si{t), 

(ii) ES k ,i(t) > s k (t)si(t)-s k+l (t). 

Proof, (i): Let A n be the set of all non-empty subsets of [n]. If A € A n , let 
-Ta(£) := is a component of Q(n,i)]. Thus, 

s k (t) = \ A \ kJ A(t) 

and, since I A I B = if A n 5 / but A / 5, 
S M (i) = ^ l^l fc l s l ,J A(t)/B(t) = E I^I^W E (3-5) 

A^B AeA„ BC[n]\A 

Conditioned on I A (t) = 1, the conditional distribution of the restriction of 
Q(n, t) to is a random graph with the same distribution as G(n— \A\, t), 

apart from a relabelling of the vertices. Hence, using also Lemma 12. 11 

E( Y, \B\ l lB(t)\l A (t) = l^=ES l (g{n-\A\,t))<ES l (g(n,t)) = s l (t). 

BC[n]\A 

Consequently, taking the expectation in (|3.5p yields 

ES kjl (t) < E Y, \A\ k I A (t) Sl (t) = s k (t) Sl (t). 
AeAn 
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(ii): By (JZH), 

ES k>l {t) =E(S k (t)S l {t)) -s k+l (t), 

and it remains to show that E(S k (t)Si(t)) > Sk{t)si(t), i.e., that Sk(t) and 
Si(t) are positively correlated. This follows by Harris' inequality (a spe- 
cial case of the FKG inequality) , since S k (t) and Si (t) are (by Lemma 12. ip 
increasing functions of the edge indicators of Q(n, t), and these are indepen- 
dent. □ 



We use this first to find an upper bound for s k (t). Combining (I3.4p . (|3.2 
and Lemma l3.1( i). we find 



k — l 

s' k (t) = EV k (t)<J2l( 
1=1 ^ 



k ~ l 1 fk y 

)si+x(t)a k -i+\(t). (3.6) 



The first cases are 



s' 2 (t) < s 2 (t) 2 , (3.7) 

4(t) < 3s 2 (t)s 3 (t), (3.8) 

a'S) < 4a 2 (t)a4(t) + 3s 3 {t) 2 . (3.9) 

Integrating (|3.7p . with the initial value £2(0) = we find, e.g. via 
(l/s 2 (t))' > -1 and thus l/s 2 (t) > 1/n - i, 

s 2 (<) < z 1, < t < 1/n. (3.10) 

1 — TO 



Next, ([3S1) and (I3TTU]) yield ((1 - nt) 3 s 3 (t))' < and thus, since s 3 (0) 



n 



aa(t) < (1 _ raf)3 » 0<i<l/n. (3.11) 

We can continue recursively and obtain the following bounds. 

Lemma 3.2. For every k > 2, there exists a constant C k such that, for all 
n, 

Tl 

®S k (t) = s k (t)<C k k _ 3 , 0<t<l/n. 

Proof. We have proven this for k = 2 and 3. For k > 4 we use induction and 
assume that the lemma holds for smaller values of k; then (|3.6|) yields, for 
some constants C' k and C' k ', taking the terms I = 1 and I = k — l separately 
and using (|3.10p . 

k—2 

' u\ i n\ u\ , m Ci+\n C k _i +1 n 
s k {t) < ks 2 (t)s k (t) + ^ C k ^_ nt)2l _ 1 ^ i _ nt)2k _ 2l _ l 

kn . . C'ln 2 
< ~i ~s k (t) + 



\-nt KKJ (l-nt) 2k - 2 ' 
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Hence, ((1 - nt) k s k {t))' < C£n 2 (l - nt)-^ k ~^ and thus 



(l-nt) k s k (t)<n + ^ dn<n + 



(fc - 3)(1 -nt) k - 3 



C k n ,_, 

< □ 

~ (l-nt) k ~ 3 ' 

We write the estimate in Lemma 13.21 as s k (t) = 0(n(l - nt) 3 " 2A; ) where, 
as in all similar estimates below, the implicit constant may depend on k 
(and later sometimes /), but not on n or t (in the given range < t < 1/n). 

We can now use this upper bound in a more or less repetition of the same 
argument to obtain more precise estimates. By Lemmas 13.11 and 13.21 for 
< t < 1/n, 

E-SV j(t) = s k (t) Sl {t) + 0(s k+l (t)) = s k (t) Sl (t) + 



(1 - nt) 2k + 2l -3 
Hence, flM]) and {32} yield 

= e v k (t) = J2 \ (*) (*) + Q ( (1 _ n " )2fc+ i ) ■ ( 3 - 12 ) 

The first cases are 

s' 2 (t) = s 2 (t) 2 + 0(n(l - nt)- 5 ) , (3.13) 

4(t) = 3s 2 (t)s 3 (t) + 0(n(l -nt)" 7 ), (3.14) 

«4(t) = 4s 2 (t)*4(*) + 3s 3 (0 2 + 0(n(l - nt)" 9 ). (3.15) 
We first treat s 2 (t). 

Theorem 3.3. 

Proof. Let T := inf{t : (1 - nt)s 2 (t) = n/2}. Since f(t) := (1 - nt)s 2 (t) 
is continuous with /(0) = n and /(1/n) = 0, then < T < 1/n and for 
< t < T we have s 2 (i) > \nj{\ — nt) and thus, by (|3.13p . 

1 -!- ()\ 1 - - I +() 



,s 2 (t) / Vl 1 - nt) 5 s 2 (t) 2 / V n ( 1_nt ) 3 /' 

This implies, recalling s 2 (0) = n and noting that J" *(l — nu) -3 dn = 0(t/(l — 
nt) 2 ) (which is, like similar integrals below, perhaps simplest seen by con- 
sidering the cases nt < 1/2 and nt > 1/2 separately), 

1 1 /"V 1 V i 1 

+ / — r t dn = - - t + O 



s 2 (t) n ' J \s 2 (u)J n Vn(l — nt) 2 

— nt / / nt 

(l + Of-z; vj 

n V \n(l — nt) d 



1 — nt / / nt . , 
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Taking here t = T, we find 1 = 0(l/(ra(l — nT) 3 )) , and thus n(l — nT) 3 = 
0(1) or 1 - nT = 0{n- 1 / 3 ). Choosing A large enough, we see that if 
1 — nt > An" 1 / 3 , then t <T, and further the O term in (|3.16p is, in absolute 
value, less than 1/2. Thus (|3.16p yields the result for 1 — nt > An~ 1 / 3 . The 
result for 1 — nt < An~ l l 3 follows trivially from the bound (|3.10p . □ 

Theorem 13.31 proves (|1 .3|) by the change of variable t = — log(l — p) = 
p + 0(p 2 ) as discussed in Section [21 noting that the result is utterly trivial 
for 1 — np = 0(n _1 ). 

We continue with higher k. 

Theorem 3.4. The following holds for < t < 1/n. 
ES 3 (t) = s 3 (t) = - n J l + O f "' 



(l-nt) 3 \ \n(l-nt) 3 < 
ni3 - 2(1 - nt)) / / nt 



n\6 — zi i — ni i / / 
ES , 4 (t) = s 4 t = V - V ,- " (l + 0( — 
w w (1 - nt) 5 V \n(l 



(1-nt) 3 

More generally, for every k > 2 there exists a polynomial pt of degree 2k — 3 
such that 

ES k (t) = s h (t) = npJ——)+o( "' 



1 — nt' v(l — nt 



\2k 



= "«(l^)( 1 + °(^W))- (3 ' 17 > 

We have P2{x) = x, p$(x) = x 3 , Pa(x) = 3x 5 — 2x A . In general, for k > 3, 
Pk( x ) = x qk(%) f° r a polynomial qu{x) of degree k — 3 that is recursively 
defined by (j[fc(l) = 1 and 

1 k ~ 2 fk\ 

l'k( x ) = 2 X/ ( 7 ) Ql+i(x)q k ~l+i(x), k>3. (3.18) 
1=2 ^ ' 

Equivalently, Pfc(l) = 1 and 

^( x ) = ^fE ; Wi(x)p fc _ m (x), fc > 2. (3.19) 
i=i ^ ' 



A probabilistic interpretation of Pk{x) and a simpler recursion formula 
are given in Section [71 The polynomials pf. for small k are given in Table [TJ 



Proof. We have shown the result for k = 2, with P2(x) = x which satisfies 
(|3.19p . For larger k, we use induction and assume that (|3.17p is true for 
smaller values of k. Then, by (I3.12p . taking the terms I = 1 and I = k — 1 
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p 2 {x) = X, 

p 3 (x) = x 3 , 

Pi(x) = 3x 5 — 2x 4 , 

p 5 (x) = 15 x 7 - 20 x 6 + 6x 5 , 

p 6 (x) = 105x 9 - 210x 8 + 130x 7 - 24x 6 , 

p 7 (x) = 945 x 11 - 2520 x 10 + 2380 x 9 - 924 x 8 + 120 x 7 , 

p 8 (x) = 10395 x 13 - 34650 x 12 + 44100 x 11 - 26432 x 10 + 7308 x 9 - 720 x 8 

Table 1. The polynomials Pk{x) for k < 8. 
separately, and (|3.18|) . 

k-2 / T \ 

n 



— 1 ( k\ 

4(t) = ks 2 (t)s k (t) +J2 o ( I Wi(t)«ft-H-i(*) + 0[— 

1=2 ^ ' ^ 



nt )2k+l 



kn 2 t-^ 1 fk\ ( 1 \ / 1 

Mt)+n Pl+1 V— t ) Pk - l+1 



1-nt w ^ 2\l VI -rat/ \l-nt 

1=2 

+ 0' " 



(1 - nt) 2 ^ 1 



kn , , n 2 . / 1 \ _/ n 

Mt) + 77—77777+2 ^ 7—77 + 



1-rai ftW (l-nt) fc + 2y Hl-nt; V (1 - nt) 2 ^ 1 / ' 



Thus, 



dt \l-ntJ \(l-nt) k+1 , 

The result follows by integration, recalling that Sjt(0) = n. For the second 
form in (|3.17p . with the error term written multiplicatively, we note also 
that it follows from the recursion (|3.18p that p^ has degree 2k — 3 with a 
positive leading term; since further and p k are non-decreasing on [1, do), 
for example by (|3.18p again, and thus strictly positive there, it follows that 
Pk(x) x x 2fc_3 for x > 1. □ 

4. The variance 
Theorem 4.1. For every k > 2, Var(S*fc(t)) < S2fc(i). Hence, 

Var(S fc (t)) = 0(n(l - ni)- (4fc_3) ), < t < 1/n. 
Proof. By (|2.ip and Lemma l3.1f i). 

E(S fc (f) 2 ) = E%(t)+E&(t) < (ES fc (t)) 2 + s 2fc (t). 
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The final estimate follows by Lemma 13.21 □ 

A more precise result will be given in Section [6l This will show that the 
bound in Theorem 14. II is of the right order as long as 1 — nt ^> n -1 / 3 . 

Corollary 4.2. If 1 — nt 3> ra -1 / 3 , then S k (t) ~ p np k ( jj^ J f or every 
k>2. 

Proof. By Theorem l3.4| ESV(t) ~ n P t '(r^m) - Further. Theorems l4.1l and l3.4l 
show that 

Var(S fe (i)) /nO-nt) 3 ^^ ^ 1 \ 

" ° TT^TT = O -7- — 3 = 0(1), 



(E5 fc (t)) 2 \n 2 (l - nt)*- 4 * J \n{l - nt)'\ 
and the result follows by Chebyshev's inequality. □ 



Proof of Theorem \l.l\ As remarked above, (jl.3p follows from Theorem! 
Similarly, the case k > 2 of Theorem 14.11 yields (|1.4p . Together, these 
estimates yield fjl .51) for 1 — np > n -1 / 3 ; in the remaining case < 1 — np < 
n -1 / 3 , (|1.5p follows trivially from the estimate Ex(G(n,j))) < 1/(1 — np), 
which follows from Lemma 13.21 provided 1 — np > 1/n, and otherwise from 
the trivial x{G(n,p)) < n. □ 

Proof of Corollary \1.SX Let A > be so large that the O term in (jl.3p is 
< 1/2 for 1 - np > An' 1 ? 3 . Then ((OJ yields, for np < 1 - ^n" 1 ^ 

E x(G(n. P )) + °l-"^0("- 1/3 ). 

which shows (jl.6p for these p. In particular, for np = 1 — ^4n -1 / 3 we find 
l/E%(G(n,p)) = 0(n _1//3 ). This, and thus (jl.6p . then holds for all larger p 
too by monotonicity (Lemma l2.ip . 

The proof of (|1.7I) is similar, using (|1.5p . □ 

5. Asymptotic normality 
The quadratic variation of the martingale M k (t) is 

[M k ,M k ] t := ]T AM k (u) 2 = £ AS fc (u) 2 , 

0<U<t 0<M<t 

where AJ(s) := A(s) — A(s— ) denotes the jump (if any) of a process X at s. 
(This formula holds because M k is a martingale with paths of finite variation 
and M k (0) = 0; see e.g. [6] for a definition for general (semi)martingales.) 
Using (|3.ip . we find, in analogy with (|3,2p . that [M&,Mfc]t has drift 

\ 2 



E E 2 ( Z ) (m ) Sl+m+1 > 2k+i-l-m(t); (5.1) 

l_1 m — 1 \ / \ / 



=1 m=l 
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i.e., [M k , M k ]t — §q W k (u) du is a martingale. 

It turns out to be advantageous to work with a slightly different martin- 
gale. In order to cancel some terms later on, we multiply S k (t) by (1 — nt) k 
(cf. the proof of Theorem 13.41 where we did the same with the expectation 
in order to simplify the differential equation); we thus define 

S k (t) :=(l-nt) k S k (t), (5.2) 

which (by a simple instance of Ito's formula) has the drift 

V k (t) := (1 - nt) k V k (t) - kn(l - nt^S^t). (5.3) 

Thus, 

M k (t) := S k (t) - n - f V k (u) du (5.4) 
J o 

is a martingale with M k (0) = 0. The quadratic variation is 
[M k ,M k ] t := Y &M k (u) 2 = Y &S k (u) 2 = Y (l-nu) 2k AS k (u) 2 . 

0<u<t 0<u<t 0<u<t 

This has drift 

W k (t):=(l-nt) 2k W k (t), (5.5) 

and thus 

J0fc(t) := [M k ,M k ]t- I W k {u)du (5.6) 
J o 

is another martingale with M k (0) = 0. 

We repeat the argument and find that M k has quadratic variation 



[jfr fc A] t := Y ^k(uf= Y (A[M k ,M k ] u ) 2 = Y, A ^k( 

0<u<t 0<u<t 0<u<t 

Y (l-nu) 4k AS k (u)\ 



u) 4 



0<u<t 

which has drift, in analogy with (13. 2\\ and (15, ip 



\k-l 



i^j \z=i V y 

«l,«2,i3,U=l »=1 V *' 

thus, [Mfc, Mfc]t — J*q W / fc( M ) du is yet another martingale which starts at 0. 
Assume in the remainder of the section that 1 — nt > n -1 / 3 , i.e. 

< t < n' 1 - ra~ 4/3 . (5.8) 
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(Although some estimates require only < t < 1/n.) By Lemmas 13. l( i) and 
[331 for any k, I > 2, 

,,2 

ES k At) = 



n 



(1 - nt) 



2fc+2/-6 



Hence, (|5.7p yields 

EW k (t) = o({l-nt) 4k 



n~ 



O 



n 



1 - nt) ik - 2 J ' 



(l-nt) 8fe - 2 , 

Since Var(M(t)) = EM 2 = E[M, M]t for every square integrable martingale 
with M(0) = 0, 



t _ 



E(M fe (i))' = E[M fe ,M fc ]i = E / W fc (n)du = O 
= O 



n 



(1 - nn) 4fc " 2 



n 2 i 



(l_ nt )4fc-3 

We define, subtracting by (|3,17p an approximation to the mean, 
Y k {t) :=S k (t)-n Pk (^-—^. 
Lemma 5.1. For every k > 2 and 1 — nt > n -1 / 3 , 

n l/2 



du 
(5.9) 

(5.10) 



(1 _ n t)2fc-3/2 



Proof. 



\\Y k (t)\\ z L2 =V a rS k (t)+ ES k (t)-np k (- 



nt 



and the result follows by Theorems 14.11 and 13.41 using n(l — nt) 3 > 1. 
Lemma 5.2. For every k,l > 2 and 1 — ni > n _1//3 , 



□ 



Pi 



1 



+ o 7 



n 



3/2 



(1 - nt) 2k + 2l - g / 2 



.1 — nt/ V 1 — nt 
Proof. By ([21]) and ([5TTUD . 

and thus, using Lemmas 15.11 and 13.21 and the Cauchy-Schwarz inequality, 

1 



npi 



_L_) +1 . l(() )_ St+l(() 



S k ,i(t) - n 2 p k {—— 



nt 



in 



l-nt 



L 1 



= 0(n 3 / 2 (l - nt)- 2k - 2l+9 / 2 + n(l - nt)- 2k ~ 2l+z ) , 
which yields the result by our assumption n(l — nt) 3 > 1. 



□ 
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Lemma 5.3. For every k > 2, there exists a polynomial P k of degree 2k — 2 
given by 

fc-l fc-i 



P k (x) = x 2k Yl \ ( i) ( k JPl+m+l{x)p2k+l-l-m(x) 

1=1 m=l ^ ' ^ ' 
Yl Y \ (jj y^j ( ll+rn+lix)q2k+l-l-m(x) (5.11) 



X 

1=1 771=1 



smc/i that, for 1 — nt > n x / 3 , 



/ 1 \ f ?7 3 / 2 

T^(t)=n 2 PJ- -)+Oj 



A-ntJ n V(l " nt) 2fc -V2 

Proof. An immediate consequence of (|5,5p . (|5.1|) and Lemma I5.2I □ 

Lemma 5.4. (i) For every k > 2, there exists a polynomial Q k of degree 
2k — 3 given by 

Q' k (x) = x- 2 P k (x), (5.12) 
with Q k (l) = 0, such that, for 1 — nt > n -1 / 3 , 

~ / 1 \ / nt 1 / 2 \ 

[M t , M t ], = n Qk (— ) + L1 ( (1 _ j . (5.13) 

(ii) If n 2 t — > oo and n(l — nt) 3 — > oo, t/ien 

[M fe ,M fc ]t = nQ fc (^-^) (1 + o L i(l)) = n Q h {j[~ n ~t) 0- + °p( 1 ))' 



Proo/. (i): By (|5^]h Lemma and (Q, 
[Mfc,M fc ]t= / iy fe (n)du + ^ fe (t) 

JO 



2 ~ / 1 \ /n 3 / 2 t + ntV2 

n 2 P k [- )du + L J 



A-nuJ h V(l -nt) 2fe ~ 3 / 2 , 

and (|5.13p follows, noting that n 3 / 2 t < nt 1 / 2 . 

(ii): By (|5.12p . Q k is increasing for x > 1, and thus non-zero, and it 
follows that Qk( i^ nt ) X nt(l — nt) 3_2fe . It remains only to verify that 

nt 1 / 2 (l — nt) 3//2 = o(n 2 t(l — nt) 3 ), which is obvious under our conditions if 
we consider nt < 1/2 and nt > 1/2 separately. □ 

We will use the following general result based on Q; see [1, Proposition 
9.1] for a detailed proof. (See also 0], [1] and [nj for similar versions.) 

Proposition 5.5. Assume that for each n, M^ n \x) is a martingale on [0, 1] 
with MW(0) = 0, and that a 2 (x), x G [0, 1], is a (non-random) continuous 
function such that for every fixed x S [0,1], 

[iW.lWj^^fi) asn^oo, (5.14) 
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supE[M( n >,MW] x <oo. (5.15) 

n 

Then AfW M as n — » oo, in D[0,1], where M is a continuous q- 
dimensional Gaussian martingale with EM(x) = and covariances 

E(M(x)M(y)) = a 2 (x), 0<x<y<l. 
In particular, M^(l) iV(0, cr 2 (l)). 

Remark 5.6. Proposition 15.51 extends to vector- valued martingales; see the 
versions in 0; H3] . 

Remark 5.7. The versions in 0; [l(| are for martingales on [0, oo); it is 
easily seen that the versions are equivalent by stopping the martingales at 
a fixed time; moreover, by a (deterministic) change of time, we may replace 
[0, 1] by any closed or half-open interval [a, b] or [a, b) C [— oo, oo]. 

Further, (15, 15ft is equivalent to sup n E|M( n )(x)| 2 < oo, the form used in 
e.g. 0. 

Lemma 5.8. If n 2 t — > oo and n(l — nt) 3 — ► oo, then 
M k (t) ~ AsN(o,nQ fc ( ' 



.1 - nt, 

Proof. In order to apply Proposition 15.51 we have to change the time scale 
to a fixed interval so that the quadratic variation converges. By considering 
subsequences, we may assume that nt — > a for some a G [0,1]. We then 
define M^ n \x) for x G [0, 1] as follows. 

(i) If < a < 1, we let M^ n \x) := (nH)' 1 / 2 M k (xt), and see that 
Lemma loTiT ii) implies (15.14j) with a 2 (x) = a _1 Qfc(l/(l — ax)). 

(ii) If a = 0, we define AfW (x) in the same way, and find now that 
Lemma EHii) implies (I5.14|) with a 2 (x) = xQ' k {\). 

(iii) If a = 1, we let M^ n \x) := n" 1 / 2 ^ - nt) k - 3 / 2 M k (t n (x)), where 



t n (x) :-- 



n \ x 



x < 1 — nt, 

l-nt\ x> l_ nt . 



thus 1 — nt n (x) = min((l — nt)/x, 1). In this case Lemma l5.4f ii) 
implies (|5.14p with cr 2 (x) = c k x 2k ~ 3 , where c k > is the leading 
coefficient in Q k . 

In all cases, the same calculation yields also (|5.15p . because the factor 1 + 
0£i(l) in Lemma 15.41 is O^i(l). The result follows from the final statement 
in Proposition 15.51 □ 

Let us now consider the case k = 2. 

Theorem 5.9. If n 2 t — > oo and n(l — nt) 3 — » oo, then 

, n 2n 2 t 

S 2 (t) ~ AsN 



1 - nt' (1 - nt) 5 
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Proof. By (|32J) and (J2HD, V 2 (t) = S 2 ,2(t) = S 2 (t) 2 - 5 4 (t), and thus Q 
yields 

y 2 (t) = (1 - nt) 2 F 2 (t) - 2n(l - ni)S 2 (i) 

= ((1 - nt)£ 2 (i) - n) 2 - n 2 - (1 - ni) 2 S 4 (i). 
By Theorems 14.11 and 13,31 
E((l - nt)S 2 {t) - nf = (1 - nt) 2 Var(S 2 (*)) + ((1 - rat)ES 2 (t) - nf 

= o(^-^]+o(^±^]=o( " 



[1-ntfJ \{l-ntf) V(l-nt) 3 /' 

By Lemma S3 ||(1 - nt) 2 S 4 {t)\\ L i is also estimated by 0(n(l - nt)" 3 ; 
Hence, 

n 



V 2 (t) = -n 2 + L i 



.(1 - nt) 3 

We now obtain from (15. 4ft 

/** / nt \ 
S 2 {t) = M 2 (t)+n+ V 2 {u)du = M 2 {t)+n-n 2 t+0 L A- -A (5.16) 

For fe = 2, (f5JTD and (15321) yield P 2 (x) = 2x 2 g 3 (x) 2 = 2x 2 and Q 2 (x) = 
2{x — 1). Hence Lemma 15.81 yields 

M 2 (t) ~AsN(0, I -^J. (5.17) 

It is easily verified that < x ^ t yi < (t=^) 1/2 - Hence, (pU6l) and (1537) yield 

/ 2n 2 t \ 

5 2 (t)~AsN(n(l-nt), r ^j. 

Recalling the definition S 2 (t) = (1 — nt) 2 S 2 {t), we obtain the assertion. □ 



Proof of Theorem \1.3\ asymptotic normality. Immediate from Theorem [57 
by our usual relation x(G( n ,p)) = n~ 1 S 2 (— log(l — p)). □ 

For k > 2, the argument is more involved, and we will be somewhat 
sketchy. We assume 1 — nt ^> n -1 / 3 and consider first k = 3. By (|3.2j) and 
(123) . V 3 (t) = 35 2 , 3 (t) = 35 2 (t)5 3 (t) - 35 5 (t), and thus ^ yields, using 
(15.10j) , Lemmas 13.21 and 15.11 and the Cauchy-Schwarz inequality, 

V 3 (t) = (1 - ntfV 3 (t) - 3n(l - nt) 2 S 3 (t) 

= 3(1 - nt) 3 (s 2 (i) - ^-^-^(t) - 3(1 - ntfS 5 (t) 

= 3(1 - nt) 3 Y 2 (t)S 3 (t) - 3(1 - nt) 3 S 5 (t) 

= 3n(l-ntfp 3 (-^—)Y 2 (t) + L / 



1-ntJ w V(l-ni) 
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Hence, by (|5.4j) . recalling p 3 (x) = x 3 , 

S 3 {t) = M 3 (t) +n + 3nj* Y 2 {u) du + O l i ( ) , (5.18) 

where we may ignore the O term but not the integral, unlike the correspond- 
ing expression (15,161) for k = 2. We find from (15.161) 

Y 2 (u) = (l-nuy 2 (S 2 (u)-n(l-nu)) = (l-nu)- 2 M 2 (u)+0 L1 ( — ). 

\ (1 — nuy' 

Hence, ([5718]) yields 

5 3 (t) -n = M 3 (t) +3nj(l- nu)- 2 M 2 {u) du + O l x ( _"^ 3 ) • (5-19) 

We applied above Proposition 15.51 to M2, but we only used the result 
Lemma 15.81 for a single i. Now we use the full process statement of Propo- 
sition 15.51 from which we conclude (after a change of variables as in the 
proof of Lemma 15.81) that f Q (l — nu)~ 2 M 2 {u)du also has an asymptotic 
normal distribution. Moreover, by the vector-valued version of Proposi- 
tion 15.51 mentioned in Remark 15.61 the argument in the proof of Lemma 15.81 
yields joint asymptotic normality of the processes M k for different k; this 
uses a straightforward extension of Lemma 15.41 to quadratic covariations 
[MfojMfcjJt. As a result, the first two terms on the right hand side of (|5.19p 
are jointly normal, and the O term can be ignored. (The right normaliza- 
tion here is, cf. Theorem 14.11 to divide by n 2 t 1 / 2 (l — nt)~ 9 / 2 .) A careful 
but rather tedious (even with Maple) calculation of the involved covariances 
yields S 3 (t) ~ AsN(n, Q 3 (l/(l-ni)) with Q 3 (x) = 96x 3 -198x 2 + 126x-24. 
Hence, with P 3 (x) = x 6 Q 3 (x) = 96 x 9 - 198 x 8 + 126 x 7 - 24x 6 , 

S3W ~ As K(T^fMi^))- ( 5 ' 2 »> 

We can argue in the same way for k > 3 too, which leads to the recursive 
formula (for all k > 2, cf. (|5TT6j) and (IBTTHl) for k = 2 and 3) 

Y k (t) = (1 - nt)- k M k {t) + n(l - nt)- k £ ( . * \ 

3=2 ^ J ' 

x j\l - m-JWf (1^)15 W *• + °L ( jT^s) • (5-21) 
This yields, by induction, cf. (15.160 and (15.190 for k = 2 and 3, 

F fc (t) = (1 - nt)~ k M k (t) +n(l - nt)" fc V / P fcJ (- W(u)du 

z — ' Jo VI — nuJ 



J=2' 



nt 

(1 
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for some polynomials P k j(x) having degree at most k + 1 — j and no terms 
of degree < 1. The asymptotic joint normality of the processes M k (with 
a careful count of the degrees of the involved polynomials) now shows the 
following extension of Theorem 15.91 and (|5.20p . 

Theorem 5.10. There exist polynomials Pk{x) of degree (at most) 2k — 3 
such that if 1 — nt 2> n -1 / 3 , then 

S k (t)~AsN(np k (—^),P k (—!—)), k>2. 



.1 — nt/ v 1 — nt/ 
Furthermore, this holds jointly for all k > 2, with asymptotic covariances 
given by polynomials P k) i{x) of degree (at most) 2k + 21 — 3. 

We have, for example, P 2 (x) = 2x 5 , P 3 {x) = 96x 9 -198x 8 + 126x 7 -24x 6 
(as said above), and P 2 ,3(x) = 12 x 7 — 18x 6 + 6x 5 . To find P k = P kt p. and P k j 
in general by this method seems quite difficult, although it is in principle 
possible using computer algebra. In the next section we will, by a different 
method, find the asymptotics of the covariances of the variables Sk{t). It is 
natural to conjecture that these coincide with the asymptotic covariances in 
Theorem 15.101 which be general probability theory, e.g. 0, Theorem 5.5.9], 
is equivalent to uniform square integrability of each of the standardized 
variables (Sk(t) — ~ESk(t))/ Var(5fc(t)) 1//2 as n — > oo. This is very plausible 
(and thus verified for k = 2 and 3 by our calculations of P 2 and P3), but we 
have so far been unable to verify it in general, and we leave this as an open 
problem and conjecture. (It would suffice to consider the case nt < 1/2, say, 
and show for example that then E \Sk(t) — ES , / c (t)| 4 = 0(n 2 ).) 



Conjecture 5.11. P k i equals the polynomial P k i defined in (|6.1 



Remark 5.12. The purpose of introducing Sf. in (|5.2p is that if we argued 
directly with S k and M k , we would obtain an equation similar to (|5.21|) . but 
with Yk(u) in one of the integrals on the right hand side. Thus, to derive the 
asymptotic normality of Yd) from the asymptotic normality of the processes 
Mk, we would have to invert a Volterra equation (also for k = 2). This is 
effectively what we do by introducing S^- 

6. The variance again 

In Theorem 14.11 we gave a simple upper of the variance for the variance 
of Sk(t). We shall now, using a more involved argument, find the precise 
asymptotics. 

Theorem 6.1. For every k, I > 2 and < t < 1/n, 

cov(5 l(t) , m ) = n PtJ (^l.) + o( (1 _^ )Wa ; 

where P k ,i is a polynomial of degree 2k + 21 — 3 given by 

r> r \ 1 \ Pk+i(x)pi +1 (x) 

Pk,i(x) = Pk+i{x) • (6.1) 
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P2,2(x) = 2x 5 -2x 4 

P 3y3 (x) = 96 x 9 - 198 x 8 + 126 x 7 - 24 x 6 

P 4 , 4 (x) = 10170 x 13 - 34050 x 12 + 43520 x 11 - 26192 x 10 + 7272 x 9 - 720 x 8 

p 3 2 (x) = 12 x 7 - 18 x 6 + 6 x 5 

P 4)2 (x) = 90 x 9 - 190 x 8 + 124 x 7 - 24 x 6 

P 4)3 (x) = 900 x 11 - 2430 x 10 + 2322 x 9 - 912 x 8 + 120 x 7 

Table 2. The polynomials Pk,i(x) for k,l < 4. 

Some polynomials P k> i are given in Table EJ In particular, -P2, 2(1/2/ ) = 
2(1 — y)/y 5 and thus 

v "< & < f » = (T^( 1 + G^^)> <•*> 

For 1 — nt < n -1 / 3 , Theorem 16.11 is a trivial (and uninteresting) conse- 
quence of Theorem 14.11 and the Cauchy-Schwarz inequality, so we assume 
in the sequel that 1 — nt > n -1 / 3 . We precede the proof by several lemmas; 
we begin by defining, extending (12. ip . 



Sk!,...,k m (G) := \Ch\ kl " " " |Cj,„| fcm , 

where ^ denotes the sum over distinct indices only. Then, cf. (|2.ip . 
Sk u ...,k m (G) = Sk lt „.,k m - 1 (G)S km (G) 

- Ski+k m ,-,k m -i(G) Ski,...,k m -i+k m (G), (6.3) 

where we subtract m — 1 terms with /c m added to one of k%, . . . , k m -i. 
For G = Q(n,t) we write S^... .,&,„(£) and have the following estimate, cf. 
Lemma 13.11 

Lemma 6.2. For each k±, . . . , k m and 1 — nt > n _1 / 3 ; 

ES * = - (1^5) (» + 

Proof. Immediate by Theorem 13.41 (j6.3[) and induction over m. □ 
We write Sk(t;n) when needed to show the number of vertices explicitly. 
Lemma 6.3. For each k > 2 and 1 — nt > re -1 / 3 , 

ES k (t; n + 1) - ES k (t; n) = P^^) + o( (1 _ ^ )2fc+1 ) , (6-4) 

where p* k is a polynomial of degree 2k — 2 given by 

p* k (x) := p k {x) + (x 2 - x)p' fc (x) = x" 1 p k +i{x). (6.5) 
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The formula (|6.4p is, not surprisingly, essentially what a formal differen- 
tiation of (|3.17l) with respect to n would give. 

Proof. Let Q(n,t) have the components Ci, ... ,Ck- Add a new vertex and 
add edges to it with the correct probabilities, and let AS k := <Sfc(£; n + 1) — 
Sk(t;n) be the resulting increase of Sk(t). Let Jj be the indicator of the 
event that there is an edge between the new vertex and Cj. Then 

AS 2 = 1 + 2 \ c i\ J i + 1^2 2\Ci\\Cj\ J i J ji 

i i,j 

A5 3 = 1 + £(3|Ci| + 3|C,| 2 )J, + X - Y^{m\ 2 \Cj\ + 3|C i ||C i | 2 + 6]^])^ 

« hi 

+ 6|Cj||Cj||Cfc| JiJj Jfc, 

and so on. Given the components C\, C 2 , ■ ■ ■ , the indicators Ji are indepen- 
dent with E Ji = 1 - e- |Cl|t = |Cj|i + 0(|Ci| 2 i 2 ). Hence, for k = 2, using 
|Cj|t < nt < 1 to simplify terms like \Ci\ 2 t 2 \Cj\ 2 t 2 , 

E(AS 2 \G(n,t)) = l + 2tS 2 (t) + 0(t 2 S 3 (t)) + t 2 S 2 , 2 (t) + 0(t 3 S 3 , 2 (t)). 

Taking the expectation we find, using Lemma 16.21 

E A5 2 = 1 + 2nt P2 ( Y ±- i ) + (-ft*^)' + 0((r^)' («- 6 ) 

The same argument applies to every /c, and yields an expression for EASfc 
where the main terms are of the type c(ni) m pfc 1+ i • • ■ Pk m +ijt^i, where c is 
a positive combinatorial constant, < m < k, 1 < k{ < k — 1 and fcj < k; 
the error terms are all 0(l/(n(l — nt) 3 ) of some such terms. The main terms 
are polynomials in 1/(1 — nt) of degree ^,A2ki — 1) <2k — 2, so the result 
can be written as ()6.4[) for some polynomial 

To identify p* k , fix y € (0, 1/2) and a rational e G (0, 1), consider only n 
such that en is an integer and let t = y/n and repeat (16. 4ft en times. This 
yields 

E ( S fe (t;(l + £)n)-E5 jfc (i;n)=£n(p|( r ^) + 0(e)) + 0(e), 
and thus, by Theorem 13.41 

(1 + e)n Pk ( 1 _ ( l +e)y ) ~ n Pk (^) = en pl (_L) + 0(e 2 n) + O(l). 
Divide by n and let n — > 00; this gives 

^(r^) = < 1+ *(rr(^;)-»(r^)+^ 2 )- 

Divide by e and let e — > 0; this gives, with aj = 1/(1 — y), 

Pk( x ) = Pk{x) + j- —vz P'k( x ) = Pk(x) + (x 2 - x)p' k (x). 
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The final identification of this as x p k +\(x) follows by (|7.8p proved in 
Section below. Alternatively, the proof of Theorem 16.11 below and the 
symmetry of Cov(S k ,Si) shows that p k+i - Pk+ipf = Pk+i - Pl+iP* k , and 
thus, choosing I = 2, p* k = p k +i ■ P2/P3, which yields the formula, since it 
follows from (]6.6[) that p^(x) = x 2 . (This thus gives an alternative proof of 

(EHD.) " □ 



Proof of Theorem \6.1\ Let A n and lA(t) be as in the proof of Lemma 13.11 
Conditioned on lA(t) = 1, the complement of A is a random graph equivalent 
to Q(n - \A\,t). Thus, 

Cav(S k (t),Si(t)) = e(J] \ A \ kl A(t) E \B\ l I B (tj)-ES k (t)ESi(t) 

AeA n BeA„ 

= E l^| fc+ ^(t)+IE £ |A| fe / j4 (t)( E |5| J / B (t)-E5K^ 
Ag,4„ AeAn BnA=(& 

= ES k+l (t) + E Y, |A| fc / A (t)(E^(t;n-|A|)-E5 ; (t;n)). 
AeAn 

By Lemma |6.3[ for some £ [0, 1], 
E5,(*;n)-E5,(n;n-|A|) = |A|p*f__L- r _) + of 



l-nt + 0|A|i/ \(l-nt) 2l+1 



A-ntJ V(l-nt) 2 '- 1 / V(l -nt) 2 ^ 1 )' 
Consequently, 



Cav(S*(t),S z (t)) =E5 fe+ i(i) -E%(t)p, 



.1 - nt 

+ ({idg** E Sl+2(t) ) + °( ird<pw E s *+> 

and the result follows by Theorem 13.41 □ 

In the case nt — ► 1, only the leading term of is significant in Theo- 
rem 16.11 Since the leading term of p k is (2k — 5)!!x 2fc_3 , as follows by (|7.8|) 
in Section [71 we have the following corollary. 

Corollary 6.4. For every k, I > 2, ifnt—^1 with 1 — nt S> n -1 / 3 , then 

Cov(S k (t),St(t)) ~ c fc ,jn(l - nt) 3 " 2fc - a , 
with c k} i := (2k + 21- 5)!! - (2Jfe - 3)!! (21 - 3)!!. 
In particular, under these conditions, 

Var (S 2 (t)) - 2n(l-nt)- 5 , 
Var(£ 3 (t)) ~ 96n(l -nt)~ 9 , 
Var(S 4 (t)) - 10170 n(l - nt) -11 , 

cf. Table [2]and dO>. 
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Proof of Theorem \1.3\ asymptotic variance. Immediate from Theorem 15. 9| 
see (IQl. □ 



7. The Borel distribution 
Let T(z) be the tree function 

OO — \ j 

Tf \ J Z | . -1 

and recall the well-known formulas T(z)e~ T ^ = z (\z\ < e~ 1 ), T(ae~ a ) = a 
(0 < a < 1), and 

T'(z) = . T{ - Z \ . . (7.1) 
z{l-T(z)) 

A random variable B\ has the Borel distribution Bo(A) with parameter 
A e [0, 1] if 

j! T(Ae ) ;! 

(7.2) 

The probability generating function of the Borel distribution is thus 

-^=£^ = 0^ = ^ = ^. (T.3) 

It is well-known that Bo (A) is the distribution of the total progeny of a 
Galton- Watson branching process where each individual has Po(A) children; 
for this and related results, see e.g. @; H; H S3; B S ESQ- 



Now consider G(n,p) with p = X/n for a fixed A < 1, and let C v be the 
component containing a fixed vertex v. It is easily seen that as n — > oo, for 
every fixed j > 1, P(|C„| = j) — > IP(-E>a = j) given by (|7.2j) . either by the usual 
branching process approximation and the result just quoted, or by a direct 
estimation of the probability, using Cayley's formula for the number of trees 
of order j and the fact that w.h.p. the component C v is a tree. In other words, 

\C V \ B\. For any integer m, the moment E \C v \ m = E S m+ i(G(n, p)) / n, 
and Theorem 13.41 shows, with t = — log(l — p) and thus nt — > A, that 

E5 m+ i(G(n,A/n)) / 1 

= ► Prra+l I 



n V 1 — A/ 

Since thus \C V \ converges in distribution and all moments converge (to fi- 
nite limits), the moments have to converge to the moments of the limit 
distribution. We have thus shown the following. 

Theorem 7.1. The polynomials pk describe the moments of the Borel dis- 
tribution Bo(A) by the formula 

EB^ =Pm+1 (_L_), m >l. 
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For example, as is well-known, ¥,B\ = (1 — A) -1 and E5? = (1 — A)~ 3 . 

Remark 7.2. By Theorem 17. H Corollary 14.21 can be written 

S k (t) ~ p JiEB^" 1 , 1 - nt > n- 1 / 3 . 

This is not surprising since we have Sk(G) = J2 V \^v\ j an d we expect 
only a weak dependence between the components C v in this range, so this is 
a kind of law of large numbers. 

Let, cf. (17. 3ft . for \t\ small enough, 

#;A)=Ee fB * = V^E5 A m = 1 , (7-4) 

m=0 

be the moment generating function of ~ Bo(A). The moments of B\ can 
be obtained by differentiation of ip(t; A) at t = 0. 

Lemma 7.3. For each m > there exists a polynomial r m such that 

-,&(*; A) = -A— l Tm ( . (7.5) 



di™ rv ' ' A VI — T(Ae~ A e*) 

We /iaue ro(x) = 1, ro(x) = x, and 

r m +i(x) = xr m (x) + (x 3 - x 2 )r' m (x), m > 0. (7.6) 

Proof. For m = 0, ([73]) is just flZgp . 

Suppose that (|7.5p holds for some m > 0. Then, by the chain rule and 
(I7TD . withT = T(Ae- A e*), 

d m+1 ^ _ d (T T 
dpn+lW'** ~ dT { X rm \l-Tj) ' 1-T 

IT / 1 \ T 2 , / 1 



\l-T \1-TJ A(l-T) 3 m \l-T 

_ T f 1 / 1 \ / 1 1 

A \ 1 - V 1 - Tv V(l-T) 3 (l-T) 2 J' m \l-T 

which verifies (17. 5j) for m+ 1 with r m +i given by (17.61) . □ 

Since r*i (x) = x, it follows from (|7.6p by induction that r m has degree 
2m — 1 for m > 1. 

Setting t = in (f75|) yields 

Consequently, Theorem 17.11 shows that 

r m (x) =p m+ i(z), m > 1. (7.7) 
In particular, (|7.6p yields the simple linear recursion 

Pfc+ifc) = xp k (x) + (x 3 - x 2 )p' k (x), k>2. (7.8) 
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It is evident from (|7.8f) and induction that, for k > 2, the leading term of 
Pk is (2k — 5)!! x 2k ~ 3 (with the standard interpretation (— 1)!! = 1) and that 
for k > 3, the lowest order non-zero term is (-l)* _1 (fc- 2)! x k , see Table HJ 

Remark 7.4. The quadratic recursion (|3. 19|) can be seen to be equivalent 
to the quadratic partial differential equation 

— V(t;A) = (^;A)-l)-^;A), 



while the linear recursion (|7.8p is equivalent to the linear partial differential 
equation 

^(t;A) = r - I ^;A) + r - I ^(t;A). 
Remark 7.5. By Theorem 17.11 the recursion (|3.19p can be written 



or, if i? A and B" are independent copies of B\, using -j^F(B\ = j) 



dA 

_ ; ) tw r?. — ; > r,., ,,,, roi 



'irl _ j) p( J g A = j) from (TT2D . 



oo 

= £>(£? A = j). (2/ + 2j 

3=1 

= X>(£a = j) 2{j ~ lh 



which is equivalent to the well-known formula 



' X + B'{ = j) = P(B X = j) = 2-^V- 2 e-J\ j > 2; 



see e.g. [20; [l9|; |lfj|; [l3j and note that B' x + B" can be seen as the total 
progeny of a Galton-Watson process with Po(A) offspring started with 2 
individuals, or as the limit distribution of \C V U C w \ if C v and C w are the 
components containg two given vertices in G(n, X/n). 

Remark 7.6. The cumulants x m of the Borel distribution Bo(A) are the 
Taylor coefficients of log ip(t; A) at t = (times ml). Since T(z) = ze T ^ z \ 
(H3D yields 

logV(t; A) = T(Ae"V) - A + t = Xip(t; X) - A + t, 

and thus 

d m / 1 \ 

x m (B\) = — log A) | t=0 = AEB A m = A Pra+ i(— ), m > 2, 

while, of course, *c\(B\) =EB^ = (1- A) -1 . 



21 



SVANTE JANSON AND MALWINA J. LUCZAK 



We can interpret the asymptotic covariances and the polynomials i in 
Section [6] by introducing the size-biased Borel distribution B\ defined by 

HBx = j) = JF( 5r =J) = (1 " X) 3 ^- l e-i\ (7.9) 
EB X j\ 

Then 

EB A m = EB™ +1 /EBA = (l-A) Pm+2 ( r ^), m > 0, (7.10) 

and thus, by (|6. 1 1) . 

^(l^)"!^ -^ 1 ,^ 1 ). (7.1!) 

Hence, by Theorem 16. 1\ the random variables n _1//2 (l — nt) 1 / 2 Sk(t), k > 2, 
have asymptotically the same covariance structure as B^ 1 . 



Appendix A. The supercritical case 

Consider G(n,p) with np - 1 > ra -1 / 3 . It is well-known, see e.g. (Til . 
Chapter 5], that w.h.p. G(n,p) has a unique giant component. More pre- 
cisely, there is a deterministic function p > on (1, oo) such that, if the 
components C\,C%,... of G(n,p) are ordered with \C%\ > IC2I >■■■■, then 
\C\\ ~ p np{np) » n 2 / 3 , while |C 2 1 = o p (re 2 / 3 ). The function p(A) is the 
survival probability of a Galton- Watson branching process with Po(A) off- 
spring, and is given by the equation 

p(\) = l-e-W, (A.l) 

The largest component is thus much larger than the others, and it turns 
out that it dominates all other terms in the sums Sfc. We write in this 
appendix Sk{n,p) for Sk(G(n,p)), and continue to let C\ denote the largest 
component of G(n,p). 

Theorem A.l. If np — 1> n -1 / 3 , then for every k > 2, 

5*(n,p) = |Ci| fc + P ( ( ^_ re i)2fc _ 3 ) ~p ~p ("/'N)" 

In particular, then x{G{n,p)) ~ p np(np) 2 . We first prove a technical 
lemma. 

Lemma A. 2. There exists a function a : (l,oo) — ► (0,1) such that the 
following holds, for some c > 0: 

(i) For any p = p(n) with np — 1 ^> n -1 / 3 , w.h.p. \C\\ > a(np)n. 

(ii) If 1< A < 2, then A(l - a(A)) < 1 - c(A - 1). 

(iii) If A > 2, i/ien A(l - a(A)) < 1 - a 

(iv) For eac/t m > 0, 1 - a(A) = 0(A~ m ). 
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Proof. For any fixed M > 1, we can take a(X) 
if s is sufficiently small. This choice satisfies 



(l-s)p(A) for 1< A < M, 
in this range (iv) is trivial, 



and it is easily seen that (ii) and (hi) follow (provided e is small enough) 
from the facts that p(X) ~2(A-1) and A(l-p(A)) = 1 - (A - 1) + 0(A - l) 2 
as A \ 1, and A(l — p(A)) < 1 for A > 1. (All three are easily verified by 
writing (|A.1|) as A = — log(l — p)/p.) 

For large A, we argue as follows. Take 7 < p(2). Thus, w.h.p. G(n,2/n) 
has a giant component of order at least jn. For A = np > 2, construct 
G(n,p) by the usual two-round method: first take G(n,2/n) and then add 
further edges independently in a second round with probabilities p — 2/n 
(or, to be precise, (np — 2)/(n — 2) > p — 2/n). If we obtain a component 
of order at least 771 in the first round, then the probability that a given 
vertex will not be joined to this component in the second round is less than 
exp(— yn(p — 2/n)) = exp(27 — 7A). Hence, w.h.p. the number of such 
vertices is less than nexp(27 — 7A/2); for A = 0(1) by concentration of the 
binomial distribution and for A — > 00 by Markov's inequality. Consequently, 
there is w.h.p. a component with more than n — nexp(27 — 7A/2) vertices; 

1 — exp(27 — 7A/2). This a satisfies (iv) too, 
We thus can use this a for A > M for some 

□ 



hence (i) holds with a(\) 
and (hi) for large enough A. 
large M, and the first construction for smaller A. 



Proof of Theorem \A.l[ Let a = a(np) be as in Lemma IA.21 and use the 
notation of the proof of Lemma 13.11 Let N be the number of components 
of size > an in G(n,p) (thus w.h.p. A > 1 by Lemma IA.2j(i)p , and let 

Zk:= E E \ B \ kI Ais. 

\A\>an BnA=<$ 

Then 

EZ k :=E lA^S k (n-\A\,p) <ENES k (n-\an\,p). (A.2) 

\A\>an 

If 1 < np < 2, then by Lemma IA.2|(ii)[ (n — [an])p < np(l — a) < 
1 — c(np — 1), and thus by Lemma 13.21 

n — \an\ 



ES k (n - \an],p) = O 



(np — l) 2fc 3 



O 



n 



(np — l) 2k 3 



If instead np > 2, then by Lemma lA.2jlfiii) (n— \an\)p < np(\ — a) < 1 — c, 
and thus by Lemmas 13.21 and lA.^ Try) with m = 2k — 3, 



E S k (n — \an],p) = 0(n — [cm]) = O 



n 



(np) 



2k-3 



Hence, for all np, 

ES k (n-\an},p)=0 ^ np _ n i)2k _ 3 ^=o(n 2k / 3 ). (A.3) 
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Note first that Z k > N(N - l)a k . Hence, by fA~2j) and (TO|) . 

E N(N - 1) < oT fc E Z k < o(E iYn 2fc/3 a" fe ) = o(E JV). 

Since N < 1 + JV(JV- 1), it follows that EN(N- 1) = o(l) andEiV = O(l); 
hence COl) and <pO|> yield EZ fe = 0(n/(np - l) 2k ~ 3 ). By Lemma 
w.h.p. |Ci| > an; in this case, |Ci| fc < Sk(n,p) < \C\\ k + Z k , and the result 
follows. □ 

Appendix B. The critical case 

The critical case is np = 1 + 0(n -1 / 3 ). By considering subsequences, it 
suffices to consider the case n 1//3 (np — 1) — > r for some r € (— oo, oo), i.e., 
np = 1 + (r + o(l))n -1 / 3 . 

We continue to use the notations of Appendix O It is well-known that 
in the critical case, \C\\ is of the order n 2 / 3 , in the sense that |Ci|/n 2 / 3 
converges in distribution to some non-degenerate random variable, and the 
same holds for \C%\, \Cs\, . . .Moreover, Aldous [l| has shown that, with no- 
tations as in Appendix |Aj the sequence (ra~ 2 / 3 \d | , n~ 2 / 3 \C 2 1 , . . . ) (extended 
by an infinite number of O's) converges in distribution to a certain random 
sequence (C r (l), C r (2), . . . ) that can be described as the sequence of excur- 
sion lengths of a certain reflecting Brownian motion with inhomogeneous 
drift (depending on r) that is defined in The convergence is in the £ 2 - 
topology, and thus immediately implies convergence of the sums of squares. 
Moreover, convergence in £ 2 implies convergence in & for every k > 2, and 
thus we also have convergence of the sums of kth. powers. Consequently, 

Theorem B.l. If np = 1 + (r + o(l))n -1 / 3 with — oo < r < oo, then for 
every k > 2, 

n- 2k / 3 S k (n,p) W k :=Y,C T {i) k . 

i 

Note that we here have limits that are non-degenerate random vari- 
ables and not constants, unlike the subcritical and supercritical cases where 
~p a n f° r a suitable sequence a n . 

Remark B.2. Janson and Spencer [13] give a related description of the 
limit of the component sizes as a point process on (0, oo). It follows 
that we also have W k = f™x k d~ ( - T \x), and thus EW k = f^x k dA ( - T \x), 
where A( T ) is the intensity of H^ 1- ) given in [13 . Theorem 4.1]. 
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